Correlation Clustering and Biclustering with Locally Bounded Errors

نویسندگان

  • Gregory J. Puleo
  • Olgica Milenkovic
چکیده

We consider a generalized version of the correlation clustering problem, defined as follows. Given a complete graph G whose edges are labeled with + or −, we wish to partition the graph into clusters while trying to avoid errors: + edges between clusters or − edges within clusters. Classically, one seeks to minimize the total number of such errors. We introduce a new framework that allows the objective to be a more general function of the number of errors at each vertex (for example, we may wish to minimize the number of errors at the worst vertex) and provide a rounding algorithm which converts “fractional clusterings” into discrete clusterings while causing only a constant-factor blowup in the number of errors at each vertex. This rounding algorithm yields constant-factor approximation algorithms for the discrete problem under a wide variety of objective functions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Minimax Correlation Clustering and Biclustering: Bounding Errors Locally

We introduce a new agnostic clustering model, minimax correlation clustering, and a rounding algorithm tailored to the needs of this model. Given a graph whose edges are labeled with + or −, we wish to partition the graph into clusters while trying to avoid errors: + edges between clusters or − edges within clusters. Unlike classical correlation clustering, which seeks to minimize the total num...

متن کامل

Bi-correlation clustering algorithm for determining a set of co-regulated genes

MOTIVATION Biclustering has been emerged as a powerful tool for identification of a group of co-expressed genes under a subset of experimental conditions (measurements) present in a gene expression dataset. Several biclustering algorithms have been proposed till date. In this article, we address some of the important shortcomings of these existing biclustering algorithms and propose a new corre...

متن کامل

Biclustering as a method for RNA local multiple sequence alignment

MOTIVATIONS Biclustering is a clustering method that simultaneously clusters both the domain and range of a relation. A challenge in multiple sequence alignment (MSA) is that the alignment of sequences is often intended to reveal groups of conserved functional subsequences. Simultaneously, the grouping of the sequences can impact the alignment; precisely the kind of dual situation biclustering ...

متن کامل

Gene Expression Data Knowledge Discovery using Global and Local Clustering

To understand complex biological systems, the research community has produced huge corpus of gene expression data. A large number of clustering approaches have been proposed for the analysis of gene expression data. However, extracting important biological knowledge is still harder. To address this task, clustering techniques are used. In this paper, hybrid Hierarchical k-Means algorithm is use...

متن کامل

ON LOCAL BOUNDEDNESS OF I-TOPOLOGICAL VECTOR SPACES

The notion of generalized locally bounded $I$-topological vectorspaces is introduced. Some of their important properties arestudied. The relationship between this kind of spaces and thelocally bounded $I$-topological vector spaces introduced by Wu andFang [Boundedness and locally bounded fuzzy topological vectorspaces, Fuzzy Math. 5 (4) (1985) 87$-$94] is discussed. Moreover, wealso use the fam...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016